Privacy Preserving ID3 over Horizontally, Vertically and Grid Partitioned Data

نویسندگان

  • Bart Kuijpers
  • Vanessa Lemmens
  • Bart Moelans
  • Karl Tuyls
چکیده

We consider privacy preserving decision tree induction via ID3 in the case where the training data is horizontally or vertically distributed. Furthermore, we consider the same problem in the case where the data is both horizontally and vertically distributed, a situation we refer to as grid partitioned data. We give an algorithm for privacy preserving ID3 over horizontally partitioned data involving more than two parties. For grid partitioned data, we discuss two different evaluation methods for preserving privacy ID3, namely, first merging horizontally and developing vertically or first merging vertically and next developing horizontally. Next to introducing privacy preserving data mining over grid-partitioned data, the main contribution of this paper is that we show, by means of a complexity analysis that the former evaluation method is the more efficient.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy in Multi-Agent Learning: Securely Inducing a Multi-Agent Decision Tree

In this paper we study the problem of how multiple distributed agents can jointly induce a decision tree, such that each of them preserves privacy over its own data and item set and each of them holds a part of the learned tree. Our work is original in the following ways: First of all, we consider agents to maintain data sites and jointly induce a decision tree. This merely reflects reality, as...

متن کامل

Privacy-Preserving Classification and Clustering Using Secure Multi-Party Computation

Nowadays, data mining and machine learning techniques are widely used in electronic applications in different areas such as e-government, e-health, e-business, and so on. One major and very crucial issue in these type of systems, which are normally distributed among two or more parties and are dealing with sensitive data, is preserving the privacy of individual’s sensitive information. Each par...

متن کامل

SMC Protocol for Naïve Bayes Classification over Grid Partitioned Data using Multiple UTPs

The case where data is distributed horizontally as well as vertically, it refers as grid partitioned data. SMC protocol for Naïve Bayes classification over grid partitioned data is offered in this paper. Also present a solution of the Secure Multi-party Computation (SMC) problem in the form of a protocol that preserves privacy. In this system, a protocol with several Un-trusted Third Parties (U...

متن کامل

Privacy-preserving algorithms for distributed mining of frequent itemsets

Standard algorithms for association rule mining are based on identification of frequent itemsets. In this paper, we study how to maintain privacy in distributed mining of frequent itemsets. That is, we study how two (or more) parties can find frequent itemsets in a distributed database without revealing each party’s portion of the data to the other. The existing solution for vertically partitio...

متن کامل

Analysis of Privacy Preserving Clustering Approach over Horizontally Partitioned Data

Data mining is the most current topic in research area. From a very long time we are working on this topic that is how we can secure our database. There are many problems which are associated with this topic like data missing, data lost, hence we explain some approaches like hierarichal clustering, homomorphic encryption, k-means clustering and SMC by these techniques database which is horizont...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/0803.1555  شماره 

صفحات  -

تاریخ انتشار 2008